WHAT IS CLAIMED IS: 



1. A method comprising: 

detecting that an application in a first node is to failover; 

provisioning a second node to execute the application responsive to the detecting; 
and 

failing the application over from the first node to the second node. 

2. The method as recited in claim 1 wherein the provisioning comprises activating one or 
more resources used by the application on the second node. 

3. The method as recited in claim 1 wherein the provisioning comprises installing one or 
more resources used by the application on the second node. 

4. The method as recited in claim 1 wherein the second node has multiple boot 
capability, and wherein the provisioning comprises rebooting the second node from a 
partition that comprises one or more resources used by the application. 

5. The method as recited in claim 1 further comprising selecting the second node from a 
plurality of nodes. 

6. The method as recited in claim 5 wherein the second node is executing a different 
application when selected. 

7. The method as recited in claim 5 wherein the selecting comprises verifying that the 
second node includes hardware that is sufficient to execute the application. 



8. The method as recited in claim 1 further comprising adding the first node to the 
plurality of nodes to be selectable for provisioning. 

5 9. The method as recited in claim 1 wherein the first node is included in a cluster being 
used to execute the application, and wherein the method further comprises adding the 
second node to the cluster. 

10. The method as recited in claim 1 wherein the detecting comprises detecting that the 
10 performance of the application executing on the first node is less than a threshold 

performance level. 

1 1 . The method as recited in claim 10 wherein the performance is less than the threshold 
performance level for at least a predefined time interval. 

15 

12. The method as recited in claim 10 wherein the detecting comprises alternatively 
detecting a failure in a service group including the application. 

13. The method as recited in claim 1 wherein the detecting comprises detecting a failure 
20 in a service group including the application. 

14. The method as recited in claim 1 further comprising detecting a lack of success in the 
failing over. 

25 15. The method as recited in claim 14 further comprising: 

provisioning a third node to execute the application responsive to detecting the 
lack of success; and 

20 



failing the application over from the second node to the third node. 

16. The method as recited in claim 14 wherein the lack of success is due to a lack of an 
eligible node, and wherein, if detecting that the application is to failover is due to a 

5 performance of the application on the first node being less than a threshold performance 
level, the method further comprises permitting the application to execute on the first 
node. 

17. The method as recited in claim 14 wherein the lack of success is due to a lack of an 
10 eligible node, and wherein, if detecting that the application is to failover is due to a failure 

in a service group including the application, the method further comprises notifying an 
administrator. 

18. The method as recited in claim 1 further comprising: 

15 

determining that a performance level on the second node is less than a threshold; 

provisioning a third node to execute the application responsive to the determining; 
and 

20 

failing the application over from the second node to the third node. 

19. A computer accessible medium encoded with instructions that, when executed: 

25 detect that an application in a first node is to failover; 

provision a second node to execute the application responsive to detecting that the 
application is to failover; and 
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failover the application from the first node to the second node. 



20. The computer accessible medium as recited in claim 19 wherein the instructions 
which, when executed, provision the second node comprise instructions which, when 

5 executed, activate one or more resources used by the application on the second node. 

21. The computer accessible medium as recited in claim 19 wherein the instructions 
which, when executed, provision the second node comprise instructions which, when 
executed, install one or more resources used by the application on the second node. 

10 

22. The computer accessible medium as recited in claim 19 wherein the second node has 
multiple boot capability, and wherein the instructions which, when executed, provision 
the second node comprise instructions which, when executed, reboot the second node 
from a partition that comprises one or more resources used by the application. 

15 

23. The computer accessible medium as recited in claim 19 wherein the instructions, 
when executed, select the second node from a plurality of nodes. 

24. The computer accessible medium as recited in claim 23 wherein the instructions 
20 which, when executed, select the second node comprise instructions which, when 

executed, verify that the second node includes hardware that is sufficient to execute the 
application. 

25. The computer accessible medium as recited in claim 23 wherein the instructions, 
25 when executed, add the first node to the plurality of nodes to be selectable for 

provisioning. 

26. The computer accessible medium as recited in claim 19 wherein the first node is 
included in a cluster being used to execute the application, and wherein the instructions, 
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when executed, add the second node to the cluster. 

27. The computer accessible medium as recited in claim 19 wherein the instructions 
which, when executed, detect that the application is to failover comprise instructions 

5 which, when executed, detect that the performance of the application executing on the 
first node is less than a threshold performance level. 

28. The computer accessible medium as recited in claim 27 wherein the performance is 
less than the threshold performance level for at least a predefined time interval. 

10 

29. The computer accessible medium as recited in claim 27 wherein the instructions 
which, when executed, detect that the application is to failover comprise instruction 
which, when executed, alternatively detect a failure in a service group including the 
application. 

15 

30. The computer accessible medium as recited in claim 19 wherein the instructions 
which, when executed, detect that the application is to failover comprise instruction 
which, when executed, detect a failure in a service group including the application. 

20 3 1 . A system comprising a plurality of nodes, wherein a first node of the plurality of 

nodes is configured to monitor performance of an application executing on a second node 
of the plurality of nodes during use, and wherein, in response to a detection that the 
application is to failover from the first node, a third node is configured to be provisioned 
to execute the application and wherein the application is failed over to the third node 

25 during use. 

32. The system as recited in claim 3 1 wherein provisioning the third node comprises 
activating one or more resources used by the application on the second node. 
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33. The system as recited in claim 31 wherein provisioning the third node comprises 
installing one or more resources used by the application on the third node. 



34. The system as recited in claim 3 1 wherein the third node has multiple boot capability, 
5 and wherein provisioning the third node comprises rebooting the third node from a 

partition that comprises one or more resources used by the application. 

35. The system as recited in claim 31 wherein the second node is included in a cluster 
being used to execute the application, and wherein the third node is added to the cluster. 

10 

36. The system as recited in claim 3 1 wherein the first node is configured to detect that 
the performance of the application executing on the second node is less than a threshold 
performance level. 

15 37. The system as recited in claim 36 wherein the performance is less than the threshold 
performance level for at least a predefined time interval. 

38. The system as recited in claim 31 wherein the second node is configured to detect a 
failure in a service group including the application, and wherein the application is to 
20 failover from the second node if the second node detects the failure. 
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